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Abstract—Cardiovascular diseases (CVD) refer to a variety of 
ailments that affect the coronary heart and are the main reason 
for mortality in the latest decades. Predicting the possibility of 
cardiac contamination is a difficult task, automating it can help 
treat the ailment early and efficiently. The prognosis of coronary 
heart disease is based entirely on previous clinical records and 
has been deemed unreliable due to several factors. Non-invasive 
strategies together with machine learning methods can be used 
to classify those with cardiac problems efficiently. In this paper, 
Internet of Things (IoT) has been utilized interfaced input 
parameters through suitable models for low latent prediction of 
heart ailments. Using the 2016 Cleveland Heart Diseases dataset, 
models have been proposed, which predict the disease with good 
accuracy. Using data engineering along with machine learning 
classifiers like - Naive Bayes, Decision Tree, Logistic Regression, 
and Random Forest- the recommended work predicts the chance 
of coronary heart sickness and classifies the affected person’s 
threat level. Consensus modelling among the above-mentioned 
algorithms has been performed and overall accuracy of 91.25% 
is achieved. 

Index Terms—Cardiovascular Disease, Machine learning, IoT, 
Google Fit. 


I. INTRODUCTION 


Over the last decade, cardiovascular diseases are the prime 
cause of death globally, taking approximately 17.9 million 
lives each year [1]. Heart diseases increase healthcare 
expenditures and lower wellness. According to the World Health 
Organization (WHO), India loses billions of dollars due to 
heart and cardiovascular diseases [2] [3] [4]. A range of 
factors, including personal and professional behaviors, as well as 
hereditary susceptibility, contribute to heart disease. Smoking, 
consumption of alcoholic beverages and caffeine, regular stress, 
and physical inactivity are all common risk factors for heart 
disease, as are health factors such as obesity, hypertension, high 
blood cholesterol, and preexisting cardiac illnesses [5]. The 
ability to diagnose heart disease rapidly, precisely, and accurately 
is essential for taking precautionary measures to avoid fatality. A 
general overview of the steps used for the detection of CVD is 
presented in Fig. 1. The correct analysis of the coronary heart 
sickness in patients is essential for reducing the associated risks of 
extreme coronary heart problems and enhancing the protection of 
the heart [6]. The invasive primary-based strategies mostly reason 
for vague prognosis and regularly put off with inside the 
prognosis 
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effects because of human errors. Moreover, it is overpriced and 
computationally complicated and takes time to assessments [7]. 


+ Verioun Déferert 
Classifiers and 
Comparing their 
beruation Metrics 


Fig. 1: Overview of steps in detection of CVD. 


Il. RELATED WORK 


Prediction of heart diseases have been an active area of study. 
Varied Machine learning and Deep learning algorithms have 
been carried out for better chances of getting high accuracy. In 
[4], Rajdhan et al. tried out nearly all possible machine learning 
models that can be used to detect the presence of cardiac disease. 
The Evaluation metrics of Random Forest, Decision Tree, 
Logistic Regression, and Naive Bayes architectures that can be 
used for classification were comparedin this study. In [7], Haq A 
et al. used seven well-known models, including logistic 
regression, Decision Trees, Artificial Neural Network, and 3 
different feature selection techniques such as Relief, mRMR, 
and LASSO to select the best-related features. The K-fold cross- 
validation was also used. To evaluate the performance of 
classifiers, many assessment measures were used. The best 
accuracy was 89% when using the FS algorithm Reliwef’s 
classifiers logistic regression with 10 - fold cross-validation. In 
[8], Ramalingam et al. examined the impact of feature extraction 
methods like Principal Component Analysis (PCA) on a variety 
of measures. Alternating decision trees have performed 
remarkably well when used with PCA, although decision trees 
haven’t performed satisfactorily, probably due to overfitting. 
Ensemble models have done exceptionally well because they 
employ numerous algorithms to overcome the problem of 
overfitting. Models based on the Naive Bayes classifier were 
extremely fast in terms of computation and performance. SVM 
worked successfully in the great majority of situations. In [9], D. 
Shah et al. defined diversity of algorithms that can be employed 
to forecast heart disease effectively. K-nearest neighbor, Naive 
Bayes, and decision tree 


were used as data mining categorization approaches. The datawas 
preprocessed before being incorporated into the model. The 
algorithms displayed were K-nearest neighbor (k = 7), Naive 
Bayes, and random forest. 


III]. PROPOSED METHOD 


The research consists of 2 parts: Namely, (a) cardiovascular 
ailments prediction using Machine Learning/ Neural Networks 
Algorithms. (b) Predicting if a user is healthy enough based on 
fitness tracker data. The aim of part (a) is to classify people based 
on their habits to predict if they could have heart disease soon or not. 
Since this is a sensitive topic, to achieve the best results for the 
prediction, several different machine learning algorithms were 
employed and tested on the data. The heart disease dataset has 
been studied by various medical institutions to better understand the 
various factors that affect a person’s heart’s functioning. The 
models are compared with each other,and the results are shown. 
The series of events involves: 


- Exploring the Framework 
- Pre-processing of Data 
- Machine Learning Algorithms 


The work proposed is the implementation of algorithms, ona 
fitness tracker and smartwatch acquired data, to accurately 
predict the heart’s condition. 


A. Exploring the Framework 


The system, shown in Fig. | consists of two parts. Namely, 
(a) compare the different machine learning algorithms and 
deploy the most suitable model. (b) used Django, HTML, 
CSS, JavaScript, and Firebase to build a web application. The 
system incorporates the use of Google Science-based metric 
*Heart Points”. The system makes API calls along with the 
OAuth2 client to the Google Fit database using the Google 
developer console. The metric hence received is used to decideif a 
person is healthy or not. 

1) Dataset Description: The dataset used in this study is 
combined and inspired from various datasets found under the 
Index of Heart Disease dataset from the UCI Machine 
Learning Repository. The major dataset is referred from the web 
repository of the University of California, Irvine. This dataset 
consists of results of 918 patients who presented for angiography 
at the Cleveland Clinic in Cleveland, Ohio. The Dataset has 11 
impartial labels and a classification label (as shown in Table J), 
which is used to diagnose coronary heart sickness. The 
classification label has binary values, where 0 represents 
absence or relatively low threat level and 1| represents higher 
chances of disease. The data is partitioned into two splits of 
training and testing in the proportion of 80% and 20% 
respectively. The whole statistics of the dataset along with its 
description can be found in [10]. The metricalong with some 
other information was stored in a vector.It is later subjected to 
Pre-processing for further refinement.Getting a bare minimum 
of heart points intermittently canassist someone to be healthy 
and free from any heart disease. Fig. 2 represents the proposed 
work structure, consisting of the ML/ Neural Networks structure. 


TABLE I: Description of the features used. 


S.N | Feature | Label Description Domain 
Oo. 
1 Age AGE Age in years 
29 ~< Age 
< 
77 
2 Sex SEX Male=0 I 
Female=1 0 
3 Chest CP 1=Atypical 787 
Pain Angina 
Typ 2=Typical 
e 3=Asymptomati 
c 
4=NonAnginal 
Pain 
4 Resting | TRESTB | Blood Pressure 
Blood | PS in mm hg 94 < rbp 
Pressure < 
200 
5 Serum | SCH In mg/dl 
Choles 120 < sch 
-terol < 
564 
6 Fasting | FBS 1=True 
Blood > 120mg/dl 0 = False 
Sugar 
7 Restin RESTEC| 0 = Normal] 0 
gECG |G i 1 
Results Having 2 
ST-T 
2 = Hypertrophy 
8 Max THALA | - 71-102 
Heart CH 
Rate 
9 Exercis | EXANG| 1=YES I 
e 0 = NO 0 
Induced 
Angina 
10 | Old OLDPE | - 0.6.2 
peak AK 
Il | Slop of | SLOPE | 1=up I 
Peak slopin | 2 
exercise g2=Flat 3 
ST Seg- 3=down sloping 
ment 
12 | Target | TARGE | 0=NO 0 
T 1=YES 1 


This section explains the resources, the datasets used, and the 
working of the proposed system. 

2) Google FIT: The project involves a novel idea of usinga 

Google-based metric for classifying whether a person is living a 
healthy life or not. The project uses the Heart points” 
[11] or ’Heart minutes” metric scientifically created by Googlein 
association with American Heart Association’s Centre for Health 
Technology & Innovation [12]. The extraction is done with the 
help of google fit API and few other technologies provided by 
Google for a safer connection between data flow. Since the 
application uses a person’s private data, OAuth2 client has been 
established for privacy and security concerns. 
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Fig 2. Overview of the proposed framework. 


B. Pre-processing 


The pre-processing of the dataset is necessary for visualization 
and efficient machine learning classification. A cleaned and 
processed data can lead to better training and validation ofmachine 
learning algorithms. Pre-processing techniques such as 
explanatory data analysis (EDA), handling missing values, and 
one-hot encoding of the categorical labels have been performed 
for better results. The data for the next part of the project, ie., 
the data from Google FIT can be used directly, [13]. The data 
has been scaled and standardized as per the min- max scaling 
algorithm using the sklearn library in python. The data is scaled 
such that the data values are numeric and are inthe range of 0 to 
1. 


C. Consensus of Classifiers 


To perform the classification between heart disease patients and 
healthy people, Machine Learning algorithms/Neural Network 
models are used. Since the target variable has only 2 values 
0 (Health) and 1(Abnormal) , binary classifiersare deployed 
[14]. The classifiers used are Random Forest Classifier, Support 
Vector Classifier (SVC), Artificial Neural Networks (ANN), 
Naive Bayes, and Logistic Regression. A consensus among 
these trained classifiers on same dataset is drawn with the help of 
Algorithm 1. In this algorithm Dataset being represented as T D. 
Initially the default label is set to healthy that turns to be abnormal 
in case the consensus resultis obtained so. C represents the total 
number of classifiers considered for the purpose. The individual 
classifier models have been explained in the subsequent 
sections. 

1) Logistic Regression: Logistic regression [15] is used to 
estimate the value of the result variable y when y is eitherO 
or 1,0 is the negative class and | is the positive class ina 
binary classification problem. Standardization of numerical 
values of the dataset is required as the model is linear and if all 
the numerical values are not within the same scale, then thebigger 
values could influence the performance of the model. The 
categorical data present is One-Hot Encoded. Categorical 


Algorithm 1 Consensus of classifier(T D) 
Require: TD#0 
Ensure: label = TRUE 
1: Read the dataset TD and set default label = Healthy 
2: Initialize counter k = 0 > There are 5 classes considered 
3: While k #|C|lk= Predict(C,) 
4. 1=1Uk 
5: Label = Majority() 
6: End of While 
7: Return Label 


TABLE II: Performance metrics obtained for the models used. 


Model Accuracy] Sensitivit | Specificit | ROC 
— y y 

Logistic Re- | 93 334% | 82.30% | 84.04% | 83.99% 

gression 

SVM 86.23% | 80.53% 90.18% 85% 

Naive Bayes | 79.43% | 94.33% 75.69% 80.73% 

ANN 89.94% | 86.98% 88.95% 84.74% 

RF 85.86% | 90.65% 94.49% 84.63% 

Consensus 91.25% | 92.05% 94.85% 89.68% 


data tends to form different patterns in the dataset which could lead 
to unexpected model behavior. 

2) Support Vector Machine: The proposed work uses a 
linear SVM model with soft margins. The dataset is processed 
similarly to the logistic regression approach. [16]. 
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Fig. 3: Depicting the layers and number of the neural networks of 
the ANN deployed to test the classification. 
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3) Naive Bayes: The approach works best when attributes are 
highly independent in nature. [14] This assumption seldom works in 
the real world and hence, the approach was expected to be the 
worst performer. 

4) ANN Stacking XGBoost: Artificial neural network being 
stacked with extreme gradient boost algorithm is considered 
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Fig. 4: Confusion matrix and ROC pairs for LR, SVM, NB, 
ANN, and RF in (a, b), (c, d), (e, f), (g, h), and Gi, j) 
respectively. 


for performing the said classification. Simply, the probabilistic 
outputs from the ANN are fed to the XGBoost module for an 
improved learning outcome. The ANN model consists of an input 
layer with 11 features. It has 2 hidden layers with 6 and 8 
neurons, respectively. Both have a dropout of 0.3 with ReLU as 
the activation function. Finally, a single neuron output layer, with 
sigmoid as the activation function, gives the probability value of 
having a heart ailment. These probabilistic values are further 
used as input for the XGBoost algorithm which is stacked with 
this ANN model. For this experimentation, the data has been 
randomly split into training, validation, and testing sets in the 
ratio of 3:1:1. [17] The stacked model is trained for 100 
epochs. A detailed structure of the ANN portion has been 
shown in Fig. 3. 

5) Random Forest Classifier: Random Forests is a ”bag- 
ging” ensemble machine learning model that consists of various 
decision trees. It uses the concept of information gain and 
entropy to make decisions [18]. Both the Decision Trees and 
Random Forests are non-linear, lazy algorithms, Standard Scaling 
of the numerical labels is present in the dataset this time. The 
dataset is fed as it is to the model. The categorical data values are 
mapped into binary form for the model to understand them 
properly and not create any confusion for the model [19]. There 
are 2 hyper-parameters related to the Random Forests algorithm- 
the *max-depth” parameter of thedecision trees that are present in 
the Random Forests. The second parameter tuned is the ” n 
estimators” i.e., the numberof trees in the forest. Both parameters 
are rigorously tested across different values using 
GridSearchCV techniques. 


IV. RESULTS AND DISCUSSION 


This section of the paper discusses the overall performance of 
the various classifiers in terms of the accuracy, sensitivity and 
specificity of these algorithms. The corresponding results of each 
classifier has been computed uniformly by _ utilizing 
GridSearchCV. K-fold (k = 5) cross-validation is used to 
measure the repeatability of the performance. Out of the above 
mentioned, it was observed that ANN stacking XG- Boost 
performed the best with the data. Further, a consensus approach 
enhances the accuracy performance at 91.25%. The probabilistic 
values used as input for the XGBoost algorithm showed a 
comprehensive outcome. Coming to the second part of the 
methodology, based on the research from the AHA’s Centre for 
Health Technology & Innovation, Heart Points data has been 
taken from Google Fit as it is a valid and scientific measure 
for checking if a person lives a healthy life. The value is 
generated because the Google FIT app will send Heart Points 
and Move Minutes at some point during walks, runs or motorcycle 
rides using smartphones or watch sensors, just like the 
accelerometer and GPS. Table 2 describes the different 
performance metrics the algorithm must represent for the 
evaluation. As shown in table 2, it is observed that the 
Consensus model works the best with the dataset. Sample 
confusion matrices and ROC’s for the models are presented in 
Fig. 4 in a pair-wise manner respectively for certain models as 
per their validations on the same sample set. A comparison 


among competent schemes based on accuracy, sensitivity and 
specificity are also presented in Fig. 5.It is observed that the 
consensus outperforms the competent schemes 


A. Pros and Cons 


The sample distribution considered for validating themethod is 
not even in nature. This makes some of the models perform 
averagely efficient as in case of regression. Further, the loosely 
related features effects the efficiency of the Naive Bayes’. Non- 
linearity and laziness cannot be exploited by the uniform intervals 
in case of random forest. However, despite of certain cons, the 
consensus of these models results in a compensatory basis 
performance that is quite satisfactory. 
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Fig. 5: Comparing the performance scores among competent 
schemes. 
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CONCLUSION AND 
FUTURE SCOPE 


Prediction of cardiovascular ailments through computational 
models is well validated through a series of experiments on 
suitable datasets. Along with this the importance of the API 
feed parameter “heart points’ is realized. A modified 
Consensus model is observed to be outperforming competent 
state- of-the-art models in terms of performance measures. The 
experimentations so carried out show an overall rate of accuracy 
of 91.25 along with sensitivity at 92.05 in favor of the Consensus 
model. It can also be concluded that heart points parameter can 
be a viable data point which can be served asa measure for 
classification of people’s health habits However analysis is to be 
carried out as future work on a way to cope with excessive 
dimensional facts and overfitting. Subsequent studies can be 
executed at the quality set of rules ensemble to rent for a 
specific kind of facts. 
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